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Abstract 

In Pawlak's rough set theory, a set is approximated by a pair of lower and upper approximations. To measure 
numerically the roughness of an approximation, Pawlak introduced a quantitative measure of roughness by using the 
ratio of the cardinalities of the lower and upper approximations. Although the roughness measure is effective, it has 
the drawback of not being strictly monotonic with respect to the standard ordering on partitions. Recently, some 
improvements have been made by taking into account the granularity of partitions. In this paper, we approach the 
roughness measure in an axiomatic way. After axiomatically defining roughness measure and partition measure, we 
provide a unified construction of roughness measure, called strong Pawlak roughness measure, and then explore the 
properties of this measure. We show that the improved roughness measures in the literature are special instances 
of our strong Pawlak roughness measure and introduce three more strong Pawlak roughness measures as well. The 
advantage of our axiomatic approach is that some properties of a roughness measure follow immediately as soon as 
the measure satisfies the relevant axiomatic definition. 
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1. Introduction 



Rough set theory was proposed by Pawlak in the early 1980s [22, 23] as a new mathematical approach for dealing 
with inexact, uncertain or vague knowledge in information systems. Since then we have witnessed a systematic, 
world-wide growth of interest in rough set theory 04|, [20(, [25|, [28L 1371 143|, [44|, [4-8 , 49, 501 15211 . Nowadays, it is widely 
recognized that rough set applications have a grea t importance in several fields, such as granular computing, data 
mining, and approximate reasoning [19, 26[ 27, 46, 51 , 531. 



A basic hypothesis in rough set theory is that some elements of a universe may be indiscernible in view of the 
available information about the elements. Such an indiscernibility relation was first described by equivalence relation 



in the way that two elements are related by the relation if and only if they are indiscernible from each other 11221 12311 . 
In this framework, a rough set is a formal approximation of a subset of the universe in terms of a pair of unions of 
equivalence classes which give the lower and upper approximations of the subset. In order to measure numerically the 
accuracy of an approximation, Pawlak introduced two quantitative measures of accuracy and roughness in |23ll: The 
accuracy of a subset is defined as the ratio of the cardinalities of the lower and upper approximations of the subset, 
which is bounded by and 1 ; the roughness of a subset is calculated by subtracting the accuracy of the subset from 1 . 
Therefore, roughness is a complementary concept to the accuracy of approximation. The roughness is, in some sense, 
the amount of uncertainty of the underlying subset. A roughness of 1 shows that we have no certain knowledge on 
the underlying subset, and a roughness of means we know everything for sure about the underlying subset. From 
this point of view, the roughness measure is an important indicator of the uncertainty and accuracy associated with a 
given subset. 

It has been observed J2, [l7 , 35] that the roughness (and its dual, accuracy) due to Pawlak, however, has the 
drawback of not being strictly monotonic with respect to the standard ordering on partitions. In other words, Pawlak's 
roughness and accuracy measures do not necessarily provide us with information on the uncertainty related to the 
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granularity of partitions. To modify such measures, Beaubouef, Petry, and Arora proposed the notion of rough entropy 
in 10]. This limitation has also been improved by Xu, Zhou, and Lu in l35ll by using the so-called excess entropy 
approach. Most recently, Liang, Wang, and Qian made another improvement by exploiting the notion of knowledge 
granulation in [17]. All of these improvements share some good properties. Nevertheless, there exists no unified 
description for roughness measure. 

The purpose of this paper is to investigate roughness measure in an axiomatic way. We first introduce an ax- 
iomatic definition of roughness measure by taking into account the common properties of the roughness measures 
from [351 and 11711 . After giving an axiomatic definition of partition measure, we then provide a unified construction 
of roughness measure, called strong Pawlak roughness measure, by combining partition measure into Pawlak's rough- 
ness measure. Some properties of the strong Pawlak roughness measure are examined in detail. Finally, we show that 
the existing roughness measures in ll35ll and lfl7ll are two special instances of our strong Pawlak roughness measure 
and present three new strong Pawlak roughness measures as well. The advantage of our axiomatic approach is that 
some properties of a roughness measure follow immediately as soon as the measure satisfies the relevant axiomatic 
definition. 

The remainder of the paper is structured as follows. In Section 2, we briefly review some basics of Pawlak's 
rough set theory and the roughness measures in the literature. The axiomatic definitions of roughness measure and 
partition measure are given in Section 3. The strong Pawlak roughness measure and its properties are also provided 
in this section. Section 4 is devoted to the case study of strong Pawlak roughness measures. We conclude the paper in 
Section 5 with a brief discussion on the future research. 



2. Preliminaries 



This section consists of four subsections. We recall the definition of Pawlak's rough sets in Section 2.1. Sections 
2.2, 2.3, and 2.4 are devoted to roughness measures proposed by Pawlak J23}], Xu et al. ||35ll . and Liang et al. lfl7ll . 
respectively. Some necessary properties of these measures are collected for later use. 



2.1. Rough sets 

We start by recalling some basic notions in Pawlak's rough set theory 11221 12311 . 

Let U be a finite and nonempty universal set, and let R c U x U be an equivalence relation on U. Denote by U /R 
the set of all equivalence classes induced by R. Such equivalence classes are also called elementary sets; every union 
(not necessarily nonempty) of elementary sets is called a definable set. For any A QU, one can characterize A by a 
pair of lower and upper approximations. The lower approximation R,(A) of A is defined as the greatest definable set 
contained in A, while the upper approximation R*(A) of A is defined as the least definable set containing A. Formally, 



R t (A) = U{C € U/R | C c A) and R*(A) = U{C € U/R \ C n A + 0}. 



It follows immediately from definition that RJA) c A c R*(A) for any A c JJ. In particular, when R»(A) - A - R*(A), 
the set A is also called R-exact. Clearly, every definable set is R-exact, and vice versa. The ordered pair (U,R) is 
said to be an approximation space. A rough set in {U, R) is the family of subsets of U with the same lower and upper 
approximations. 

Recall that a partition of U is a collection of nonempty subsets of U such that every element x in U is in exactly 
one of these subsets. We write Yl(U) for the set of all partitions of U and ^{U) for the power set of U. It is well-known 
that the notions of partition and equivalence relation are essentially equivalent, that is, for any equivalence relation 
R on U, the set U/R is a partition of U, and conversely, from any partition n of U, one can define an equivalence 
relation R„ on U such that U/R„ = n in the obvious way. Thus, we sometimes say that the ordered pair (U, n) is 
an approximation space and write n„{A) and n*{A) for R„„(A) and R/iA), respectively. More generally, we will use 
equivalence relation and partition indiscriminately. 

Whatever be a nonempty universe U, it is always possible to introduce at least two canonical partitions: One is the 
trivial partition, denoted by ft, consisting of a unique equivalence class, and the other is the discrete partition, denoted 
by ft, consisting of all singletons from U. Formally, 



tt = {t/} and tt = |xe U}. 
2 



We now define a partial order "<" on TL(U): For any n, cr e Yl(U), 



n < cr <=> for any C en, there exists D e cr such that CCD. 

For instance, 7r < 7r < ir for any 7r € Tl(U). We say that n is finer than cr and that cr is coarser than 7r if 7r < <x. When 
7r -< cr, that is, n < cr and n + cr, we say that 7r is strictly finer than cr and that cr is strictly coarser than 7r. Informally, 
this means that n is a further fragmentation of cr. 

2.2. Roughness measure by Pawlak 

Let (U,n) be an approximation space. To characterize the uncertainty of rough sets, Pawlak proposed two nu- 
merical measures: roughness and accuracy (see, for example, B23IP . The accuracy reflects the degree of completeness 
of knowledge about a given subset A of U; it is defined by the ratio of the cardinalities of the lower and upper 
approximations. More formally, the accuracy of A (with respect to n) is defined by 

a pin, A) = 

\n*(A)\ 

where A + and "|5 1" denotes the cardinality of a set S . For convenience, we set ap(n, 0) = 1, that is, |0|/|0| = 1. 

The roughness is opposed to the accuracy; it represents the degree of incompleteness of knowledge about a given 
subset A. The roughness Bp(n,A) of A (with respect to n) is calculated by subtracting the accuracy ap(n, A) from 1, 
that is, 

B P (n,A) = 1 -a P {n,A). 

Clearly, < B P {n,A) < 1. It is easy to see that B P (n,A) = 1 if and only if n*{A) = and n*(A) + 0, and 
j3p{n,A) — if and only if A is ;r-exact. 

To state the next property, we need one more notation. By B P (n, ■) < /3p(cr, •) we mean that B P {n, A) < f3p(cr, A) for 
every A c U and fip(n, B) + Bpicr, B) for some B c U. Similar usages of "<" will appear in the subsequent sections. 
The following property follows from the fact that for any A c U, cr„(A) C n t (A) c A c n*(A) c cr*(A) if n < cr. 

Property 1. For any n, cr e Tl(U), if n < cr, then B P (n, ■) < B P (cr, •). 

The roughness and accuracy measures are of simple expressions and are suitable for evaluating the uncertainty 
arising from the boundary region. Nevertheless, neither the roughness nor the accuracy reflects the granularity of the 



underlying partition. This limitation has been pointed out by several researchers J2, 17, 35]. For the convenience of 
the reader, we record a simple example. 

Example 1. Let U — {oi, aq_, 03,04, as), n — {{«i}, {02,03}, {04, 05}}, and cr — {{a\, aq_, 03}, {04, 05}}. Then it is easy to 
see that ti and cr are partitions of U , andmoreover, n < cr. Assume that A — {a\, ai, 03, 04}. We thus have by definition 
that 

7i t (A) — cr t (A) — {01,02,123} andn*(A) — o~*(A) — {01,02,03,04,05} = U. 

Therefore, ap(n,A) — ap(p,A) — 0.6 and Bp(n,A) — Bp(p,A) — 0.4. It means that partitions with different granu- 
lations may give rise to the same accuracy (and thus roughness) for some subsets. This property is, of course, not 
desirable. 



2.3. Roughness measure by Xu et al. 

To improve the accuracy and roughness measures suggested by Pawlak, Xu et al. rf35ll combined the granularity of 
the underlying partition into Pawlak's accuracy by exploiting the so-called equivalence relation graphs. To introduce 
their definition, we need several notions. 

Let {U,n) be an approximation space. The equivalence relation graph with respect to n is defined as G(n) = 
(N{n), E(n)), where N(tt) = U and E{n) = {(x,y) eUxU\(x,y)e R K }. For any vertex v on G(n) = (N(tt), E(tt)), let the 
subgraph corresponding to v be G(n) (v) = (N(tt) (v \ E(tt) (v) ), where A^(7r) (l,) = N(n) and E(tt) {v) = {(v,y) | (v,y) e E(n)}. 
For any given G{n), let Pjjg^ v) be the proportion of the row vector L(G(n), v) out of |iV(7r)| row vectors in the incidence 
matrix of G(n), where L(G(n), v) is the row vector corresponding to v in the incidence matrix of G(n). More formally, 
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PuGin) v) ~ w /l^( 7r )l' where n is the number of row vectors in the incidence matrix of G{n) that are equal to L(G(n), v). 
For an equivalence relation graph G(n), the minimum description length of G(n) is defined b>Q 

I(G(n)) = - J] logpgg^. 

veJV(Tr) 



Finally, we may state the key definition and results in [35] as follows. 

Let (U, n) be an approximation space. The roughness of A (with respect to n) due to Xu et al. is defined by 

con(G(n\) 

B x (n,A)=B P {n,A)- 



|I/|(|I/|-l)log|C/|' 
where Bp(n, A) is the Pawlak's roughness of A and 

con{G{n))= J] I{G{nf v) ) - I{G{n)). 

Note that 7(G(7r) (v) ) is the minimum description length of the subgraph G(7r) (y) corresponding to v. 

It follows from [[35] that the new roughness measure B x (tt, A) enjoys some useful properties, which are listed as 
follows. 

Property 2. For any n e U(U) and A c U, < B x (n,A) < 1. 

Property 3. B x {n, A) = for any A c U; B x (n, U) = 0, and /3 x (tt,A) = 1 for any A CU. 
Property 4. For any n, cr e II(f/), if w < <x, then f3 x (n,A) < /3 x (cr,A) for any A c C/. 

Property 5. Let n e 11(17) and A, B c U. 

1) If ff.(A) = 7T«(B), then yS z (7r, A flB)< min{/3 x (7r, A),p x (n, B)}. 

2) If tt*(A) = n*{B), then y3 z (7r, A UB)< mm{B x (n,A),B x {n, B)}. 

It should be pointed out that B x has not the same drawback as shown for Bp in Example Q] 

2.4. Roughness measure by Liang et al. 

Based on the notion of knowledge granulation due to Miao and Fan [21], Liang et al. proposed a rough- 
ness measure lfl7ll which is simpler than that in ll3~5ll . Let {U, n) be an approximation space and suppose that 
n = \C\ , C2, . . . , C m }. The roughness of A (with respect to n) due to Liang et al. is defined by 

\u\ 

where Bp{n,A) = 1 - |7r,(A)|/|7r*(A)| is the Pawlak's roughness of A and the second term '7', 1 represents the 
knowledge granulation of n introduced in 112 ID . 

The following properties of the roughness measure Bl{tt,A) were given in II 1711 . 

Property 6. For any n <eH(U) and A c £/, < B L (n,A) < 1. 

Property 7. If B L (n,A) = for all A c [7, then 7r = ir; if yS L (7r, A) = 1 for some A + U, then n = fc. 
Property 8. For any n, cr 6 U(U), if n <cr, then B L {n,A) < B L (cr,A) for any A c U. 

Like /Jx, the roughness measure Bl has not the same drawback as shown for Bp in ExampleQ] 



All logarithms are to base 2 unless otherwise specified. 
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3. An axiomatic definition of roughness measure 

This section is composed of three subsections. Motivated by the roughness measures in lfl7l l35ll . we present 



an axiomatic definition of roughness measure and discuss its basic properties in Section 3.1. In order to construct 
more roughness measures, we introduce an axiomatic definition of partition measure in Section 3.2. By incorporating 
partition measure into Pawlak's roughness measure, we introduce a strong Pawlak roughness measure and explore its 
properties in Section 3.3. 

3.1. Roughness measure 

For later need, let us introduce the following notion which relates two approximation spaces. 

Definition 1. Let (U, n) and (V, cr) be two approximation spaces, and suppose that f : U — > V is a mapping. 



1) The mapping f is called a homomorphism/rom (U, n) to (V, cr) if for any C e n, there exists D € cr such that 
f (C) CD, where f(Q = {f(u)\u€C}. 



2) A homomorphism f is called a monomorphism ;// is an injective mapping. 

3) A monomorphism f is called strictly monomorphic if there exist C € n and D e cr such that /(C) C D, namely, 



4) The mapping f is called an isomorphism if the mapping f : U — > V is bijective, and moreover, both f and its 
inverse mapping / _1 are homomorphisms. 

Let us now present the axiomatic definition of roughness measure. 

Definition 2. Let U be a finite and nonempty universal set and f3 a mapping from 11(1/) x &(U) to the closed unit 
interval [0, 1]. We say that (3 is a roughness measure on U if the following conditions are satisfied: 

1) f3(n,A) — if and only if A is n-exact. 

2) For any n,cr e Tl(U), if n < cr, then /3(n, •) < /3(cr, ■)• 

3) For any n,o~ e Tl(U), if there is an isomorphism f from (U,n) to {U,cr), then f3(n,A) — {3(o~, f(A)) for any 



If (3 is a roughness measure on U, then the roughness of A (with respect to n) is defined by the value (3(n,A). 

Let us give a brief, informal account of the above conditions. Condition 1) just says that a set is exact if and only if 
it has roughness 0. Condition 2) requires that roughness measure strictly maintains the partial order on Tl{U). Notice 
that any isomorphism / from (U, n) to (U, cr) is actually a renaming of elements of U that keeps elementary sets. For 
example, there is an isomorphism between ({1,2, 3,4}, {{1,2}, {3,4}}) and ({1, 2, 3,4}, {{1, 3}, {2,4}}). Therefore, the 
condition 3) requires that roughness measure is only dependent on the structure (i.e., blocks) of partitions; this seems 
quite reasonable. 

It follows from the definition of Pawlak's roughness measure f3p and Property[T]that f3p is a roughness measure in 
the sense of Definition [2] To illustrate the definition, let us examine a trivial example. 

Example 2. Consider f3 : FI(L0 x £?(U) — > [0, 1] defined as follows: 



Clearly, the condition 1 ) in Definition\2\is satisfied. For the condition 2), let n,cr e TL(U) with n < cr. For any A QU , 
it follows from definition that A is n-exact whenever it is cr-exact, but the converse does not hold, that is, a n-exact 
set may not be cr-exact. It means that if/3(cr,A) — 0, then f3(n,A) — 0, and there exists A' C U such that f3(n,A') — 
while f3(cr,A') — 1. This forces that f3(n,-) < f3(cr,-), as desired. For the condition 3), note that the isomorphism 
f establishes a one-to-one correspondence between the set of n-exact sets and that of cr-exact ones. Therefore, the 
condition 3) holds, and (3 is indeed a roughness measure on U. 

By definition, we have two properties of roughness measure. The first one is a characterization of the minimum 
of a roughness measure. 



f(C)QDandf(C) + D. 



AQU. 




0, if A is n-exact 

1, otherwise. 
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Proposition 1. Let (3 be a roughness measure on U. Then f3(n,A) — holds for all A C U if and only if n — ft. 

Proof. Note that if n — ft, then every subset A of U is 7r-exact. Hence, the sufficiency follows immediately from 
Definition[2] Conversely, if /3(n,A) = holds for all A C U, then by definition every subset A of U is 7r-exact. This 
means that every nonempty subset of U is a definable set. As a result, we see that n — ft, which proves the necessity. 

The next property is a relaxation of the condition 2) in Definition[2] 

Proposition 2. Let ft be a roughness measure on U and n,cr e Tl( U). If n < cr, then f3(n, A) < f3(o~, A) for any A c U. 
In particular, f3(n,A) < f3(n,A)for any n e Yl(U) and A C U. 

Proof. It follows directly from the condition 2) in Definition[2] 

The following property is equivalent to the condition 2) in Definition[2l under the condition 3) in this definition. 

Proposition 3. Suppose that f3 is a roughness measure on U and f is a strict monomorphism from (U, n) to (U, cr). 
Then j3{n, A) < f3(cr, f{A))for any A c U, and moreover, there exists A' c U such that [3{n, A') < f3(a, f(A')). 

Proof. Since / is a monomorphism from (U, n) to (U,cr), it gives an isomorphism between (U,n) and (U,f(n)), 
where f(n) = {/(A) | A € n}. Note that U is finite and / is injective, so / is bijective and thus f(n) is indeed a partition 
of U. By definition, we see that /3{tt,A) < f3{f{n),f(A)) for any A c U. On the other hand, we have that f(n) < cr 
since / is strictly monomorphic. This means by Proposition [2] that [3( f{n),f(A)) < /3{cr,f(A)) for any A c U. As a 
result, /3{tt, A) < /3{cr, /(A)) for any A c U. The remainder of this proposition follows easily from the strictness of the 
monomorphism /. 

The condition 2) in Definition [2] just says that (3 is strictly monotonic. Depending on applications, strict mono- 
tonicity may not be so required. For example, it is usually interesting to look at dependencies between partitions 
generated by decision attributes and condition attributes in a decision system. In such cases, one may be mostly in- 
terested in weak monotonicity but not in strict monotonicity. Actually, the cases when the measure does not change 
while changing the partition into a more or less detailed one are of special importance for feature selection, feature 
subset selection, feature extraction, and feature reduction in knowledge discovery (see, for example, 
view of this, let us introduce a weak version of Definition [2] as follows. 

Definition 3. Let U be a finite and nonempty universal set and f} a mapping from TMJJ) X {^(U) to the closed unit 
interval [0, 1]. We say that /3 is a weak roughness measure on U if the following conditions are satisfied: 

1) /3(tt,A) = if and only if A is n-exact. 

2) For any n,cr e n(C7), if it < cr, then /3(n, •) < f3(cr, ■). 

3) For any n, cr e Tl(U), if there is an isomorphism f from (U,n) to (U,o~), then /3(tt,A) — /3(cr, /(A)) for any 
AQU. 

4) p(n, ■) = if and only if n — ft. 

Iff} is a weak roughness measure on U, then the weak roughness of A ( with respect to n) is defined by the value p(n, A). 

Clearly, Conditions 1) and 3) are the same as in Definition[2] and Condition 2) means that (3 is weakly monotonic. 
Note that in order to avoid /3 being constant, we add Condition 4) in the above definition. It is easy to check that 
any roughness measure in the sense of Definition [2] is a weak roughness measure, but the converse does not hold in 
general. 

3.2. Partition measure 

In order to measure partitions, we provide an axiomatic definition of partition measure as follows. 

Definition 4. Let U be a finite and nonempty universal set and h a mapping from Tl(U) to [0, +oo), the set ofnonneg- 
ative real numbers. We say that h is a partition measure on U if the following conditions are satisfied: 

1) For any n, cr e Tl(U), ifn < cr, then h(n) < h(o~). 
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2) For any n,o~ e Tl(U), if there is an isomorphism f from (U, n) to (U, cr), then h(n) — h(cr). 

Intuitively, we require that partition measures on U are only dependent upon the structure of partitions, not the 
names of elements in U. Roughly speaking, the greater the value of h, the coarser the corresponding partition. Let us 
see an example. 

Example 3. Let U — { 1 , 2, 3, 4}. Then U has 15 partitions because the total number of partitions of an n-element set 
is the Bell number B n , recursively defined by B n+ \ — Yjk=o(t)Bi< and Bq — 1 (see, for example, [5]). For simplicity, we 
write 1/2/34 for the partition {{1}, {2}, {3,4}}, alike to other partitions. With this notation, we have that 

11(1/) = {1234,1/234,2/134,3/124,4/123,14/23,13/24,12/34, 

1/2/34,1/3/24,1/4/23,3/4/12,2/4/13,2/3/14,1/2/3/4}. 
By definition, any mapping h : Yl(U) — > [0, +°o) that satisfies the following conditions is a partition measure on U: 

Ml/2/3/4) = n , 

A(l/2/34) = A(l/3/24) = A(l/4/23) = A(3/4/12) 

= A(2/4/13) = K2/3/14) = r 2 , 
A(14/23) = A(13/24) = A(12/34) = r 3 , 
A(l/234) = A(2/134) = A(3/124) = A(4/123) = r 4 , 
A(1234) = r 5 , 

where r { e [0, +°°), i — 1, 2, . . . , 5, with r\ < ri < r^ < r$ and r\ < r-i < rn, < r$. 

We remark that our axiomatic definition of partition measure is essentially based on the cardinalities of all equiv- 
alence classes in a partition. Recently, Yao and Zhao [41] have directly established a partition measure on the car- 
dinality of a partition, and moreover, they constructed an interesting measure of the granularity of a partition which 
has several existing measures as instances. We see by Theorem 3 in [41] that this new measure satisfies Definition!?] 
as well. It should be stressed that constructing and evaluating partitions are the most basic issues in rough set theory, 
since the indiscernibility is the mathematical basis of rough set the ory 12511 and there are strong relationships between 
indiscernibility measures and partition measures QjJ, [2jJ, [32|, [39|, |40|, |45[] . In addition to rough sets, the granularity of a 
partition is a very important concept in many other fields such as information theory, data mining, machine learning, 
and pattern recognition. In the literature, there are a lar ge n umber of approaches to measuring partitions (see, for 
example, JlHlSlillilllllilEElillliS Sll H 0, US E3l). In some sense, the roughness 
measure in Definition [2] as well as the weak roughness measure in Definition [3] can be viewed as a partition measure 
because for some given subsets, say A, of U, the function B(-,A) can reflect the granularity of a partition. 

The following facts follow directly from Definition |4] 

Corollary 1. Suppose that h is a partition measure on U. 

1) For any n, cr e Tl(U), ifn < cr, then h(n) < h(cr). 

2) For any n € Yl(U), h(n) < h(n) < h(n). In particular, h(n) > whenever n + n. 

As expected, partition measures have the following property. 

Proposition 4. Let h be a partition measure on U and n,o~ e Tl(U). If there is a strict monomorphism f from (U, n) 
to {U, cr), then h(n) < h(cr). 

Proof. Assume that / is a strict monomorphism from {U, n) to {U, cr). Then it is clear that / gives rise to an iso- 
morphism from (U,k) to (U,f(n)), where f{n) = {/(A) | A € n}. We thus have that h(n) = h(f(n)) by Definition!!] 
Furthermore, we find that f(n) < cr since / is strictly monomorphic. By definition, we obtain that h( f{n)) < h(cr). 
Consequently, h{ii) < h(cr), finishing the proof. 

As a corollary of Proposition |4] we get an equivalent definition of partition measure. 

Corollary 2. A mapping h : Tl(U) — > [0, +oo) is a partition measure on U if and only if the following conditions 
hold: 

1) For any n, cr e H(U), if there is a strict monomorphism f from (U, n) to {U, cr), then h(n) < h(cr). 

2) For any n,o~ e Tl(U), if there is an isomorphism f from (U, n) to (U, cr), then h(n) — h(o~). 
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3.3. Strong Pawlak roughness measure 

Given a partition measure, we can construct a roughness measure in the sense of Definition [2] as follows. 



Theorem 1. Let U be a finite and nonempty universal set. Suppose that h is a partition measure on U. Then the 
function j3 h :U(U)x &>{JT) — > [0, 1] defined by 



is a roughness measure on U. 

Proof. We need to check all the three conditions in Definition [2] 

For the first condition, observe that f3 h (7T,A) = if and only if either jp^j = 1 or h{n) - 0. Clearly, = 1 is 
equivalent to that A is 7r-exact. Note that h{n) — implies n — n by Corollary Q] In this case, every subset A of U is 
7r-exact. Hence, /3/,(7r, A) = if and only if A is 7r-exact, as desired. 

For the second condition, assume that n < cr. Then h(n) < h(cr) by definition. On the other hand, we have by 
Property [T] that f3 P (n, ■) < f3p(cr, ■)■ We thus obtain that f3 h (n, ■) < fi h (cr, ■), as desired. 

For the third condition, suppose that / is an isomorphism from (U, n) to (U,cr), where n, cr e FL(U). Then by 
definition we see that \n,(A)\ = |o%(/(A))| and \n*(A)\ = |cr*(/(A))| for any AQU. Therefore, /3p(n, A) = /3 P (cr, /(A)). 
Since / is an isomorphism, we also have that h(n) = h(cr) by definition. As a result, we get that f3h{n,A) = f3h(cr, f(A)) 
for any AQU. This completes the proof of the theorem. 

For convenience, the roughness measure fii, associated to a partition measure h is called a strong Pawlak roughness 
measure. Note that neither Pawlak's roughness measure ftp nor the roughness measure defined in Example|2]is a strong 
Pawlak roughness measure. We end this subsection with a discussion on the properties of strong Pawlak roughness 
measures. The first five properties follow immediately from TheoremQ] in which the assertion 3) justifies the modifier 
"strong". 

Corollary 3. 

1) /?/,(7T, A) — if and only if A is n-exact. 

2) For any n,cr € Tl(U), if n < cr, then f3h(n, •) S Ph(o~, ")• Consequently, if n < cr, then j3\i(n,K) < j3h(cr,A) for any 
AQU. 

3) For any n, cr e Yl(U), if n < cr and A c U is not cr-exact, then f3h(7T,A) < j3i,(cr,A). 

4) For any n,cr e Tl(U), if there is an isomorphism f from (U,n) to (U,o~), then f3h(n,A) — j3h(cr, f(A)) for any 



5) Assume that there is a strict monomorphism f from (U, n) to (U, cr). Then f3h(n, A) < f3h(cr, f(A))for any AQU, 
and moreover, there exists A' Q U such that f3h(7T,A') < J3h(cr, f(A')). 

Like the previous roughness measures f3p, fix, and fj^, the strong Pawlak roughness measure f3h is bounded as well. 
Proposition 5. For any n e Tl(U) and A Q U, < f3h(n,A) < 1. 
Proof. It follows directly from < f3p{n,A) < 1 and Corollary Q] 

In terms of the maximum and minimum of /?/,, we have further characterizations. 
Proposition 6. f3h(n,A) = holds for all A Q U if and only ifn = n. 

Proof. By the assertion 1) in Corollary [3] f3h{n,A) = holds for all A Q U if and only if every subset A of U is 
7r-exact, which is equivalent to that n — n. This proves the proposition. 

Proposition 7. If there exists AQU such that j3i t (7T,A) — I, then jt — it. Moreover, 




AQU. 
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Proof. If there exists A c U such that fihin, A) = 1, then it forces that Bp(n,A) = 1 and h(n) = h(n). The latter yields 
that n — n by CorollaryQ] The remainder of this proposition follows readily from the definition of B/,. 

Like the roughness measure Bx, the strong Pawlak roughness measure B] x has the following properties. 
Proposition 8. Let n e Tl( U) and A, B c U. 

1) I/ttAA) = n„(B), then B h (n,A n B) < n\m{B h {n,A),B h (n,B)}. 

2) Ifn*{A) = 7T*(B), then B h (n,A UB)< mm{B h (7T,A),B h (n,B)}. 

Proof. 1) Clearly, n*(A n Z?) c 7r*(A). If jt„(A) = 7r*(Z?), then we can verify that 7r»(A n Z?) = ;r»(A). Hence, 

n B)\ > \n.(A)\ 
\K'(AnB)\ - |b»(A)|' 

which means that Bp(n,A n B) < Bp(n,A). In the same way, we obtain that Bp(n,A C\ B) < Bp(n,B). It yields that 
B h (n,A DB) < mm{B h {n,A),B h (n,B)}, as desired. 

2) The proof is similar to that of 1). It is easy to see that n t (A) c n,(A U B). If n*{A) = n*{B), then we can check 
that 7r*(A U B) = 7T*(A). As a result, we have that 

|tt*(A U g)| MA)| 
br*(AUB)| - k*(A)f 

Therefore,j8/»(^,AUB) < B P {n,A). Analogously, we can obtain that Bpiji, A UB) < Bpiji, B). It gives that B^iii, A UZ?) < 
min{y6/,(^, A),y6/,(7r, Z?)}, finishing the proof of the proposition. 



4. Case study of strong Pawlak roughness measures 

As we have seen in the previous section, if one can show that h is a partition measure on U, then /?/, is a strong 
Pawlak roughness measure and thus has the properties stated in Corollary [3] and Propositions [5] [6j [7j and [8] In this 
section, we first look at the roughness measures Bx and Bi due to Xu et al. B35I1 and Liang et al. B17I1 . respectively, in 
the framework of strong Pawlak roughness measures, and then provide three new strong Pawlak roughness measures. 

Proposition 9. The roughness measure Bx due to Xu et al. is a strong Pawlak roughness measure. 
Proof. By definition, 

con(G{n)) 

B x {n,A) = B P (n,A)- 



|f/|(|f/|-l)log|f/|' 
so it is sufficient to show that h defined by 

con{G{n)) 



h{n) 



|f/|(|f/|-l)log|f/| 



is a partition measure on U and h(n) = 1. It follows from Theorem 7 in H35H that h satisfies the condition 1) in 
Definition[4] that is, ji < cr implies h(n) < h(cr). In addition, if there is an isomorphism from (U, n) to (U, &), then 
we see that con(G(n)) = con(G(cr)) by the definition of equivalence relation graph. Therefore, h{n) = h(cr). It follows 
from definition that con(G(n)) = \U\(\U\ - 1) log \ U\, namely, h(n) - 1. Hence, h is a partition measure, as desired. In 
fact, we may also define h'(n) = con(G(n)) and then verify that h' is a partition measure and h' (n) = |C/|(|C/| — l)log|I/|. 

Proposition 10. The roughness measure Bi due to Liang et al. is a strong Pawlak roughness measure. 

Proof. By definition, 

y m \c\ 2 

B L (7i,A)=B P (n,A)- ^^ , 
\U\ 

where \C\, C2, ■ ■ ■ , C m } = n. In order to show that Bi is a strong Pawlak roughness measure, it suffices to prove that 
h defined by h(n) = Yn=\ IC;| 2 is a partition measure on U and h(n) = |t/| 2 . The former follows readily from the 
definition of h and the latter is obvious. Therefore,/?/, is a strong Pawlak roughness measure. 
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Remark 1. Propositions[9]and[T0]tell us that the roughness measures Px and Pi have the properties stated in Corollary 
[3]and Propositions[3J|6]|ZJ and|8j some of these properties are missing in 1351 or [l7\\ . 



To introduce another strong Pawlak roughness measure, let us recall the notion of co-entropy QBE ESS. 
Assume that n = {C\,C2, ■ • ■ , C m ] e Tl(U). Then the co-entropy of partition n is defined as 



. m 

E(n) = — £|Q|log|Q|. 



It has been known that the following identity holds 

H(n) + E(n) = log \U\ for any n e II(t/), 
where H(n) is the entropy of partition n EES [34L [39h defined as 



H(n) 



y IQI |Q| 

4-i \u\ og \uy 



Notice that a standard result of information theory assures the strict anti-monotonicity of entropy (see, for example, 

if n < cr, then H(cr) < H(n). 

Therefore, we have the following strict monotonicity of co-entropy with respect to the partition ordering (a direct 
proof of this result can be found in lfl6l0 : 

if n < cr, then E(n) < E{cr). 



This result has been proven in a roughness monotonicity theorem of [34], which is based on a lemma of the same 
paper. We thus find that £ is a partition measure on U. Noting that E(n) — log \U\, we get the following proposition. 



Proposition 11. Define 



/3 E {n,A)=l3p{n,A)- 



E(n) 



\og\U\ 

for any n e Tl(U) and A C U. Then (3e is a strong Pawlak roughness measure. 

Let us remark that f3p{n,A) ■ E(n) was defined as the rough entropy of A in |0]. Therefore, we may view Pe{ii,A) 
as a standardization of the rough entropy of A. 

In y|], Bianucci et al. introduced a pseudo co-entropy related to a partition n = {Ci, C2, . . . , C m ) e II(J/) as 
follows: 



1 

E\n)= IQI 2 log 



It has been proven in Jit] that 



if n < cr, then E'(n) < E'(cr). 



This property, together with the definition of E', implies that E' is also a partition measure on U. By using E'(n) = 
\U\ log \U\, we obtain the following proposition. 



Proposition 12. Define 



p E .(n,A) = p P (n,A) ■ 



E'Oi) 



\U\log\U\ 

for any n e Tl(U) and A C U. Then Pe> is a strong Pawlak roughness measure. 
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We end this section with one more strong Pawlak roughness measure arising from the concept of combination 
granulation introduced by Qian and Liang in [29]. 

Let Ti = {C\ , C2, ■ ■ ■ , C,,,} 6 Tl(U). Then the combination granulation of n, denoted by CG(n), is defined as 1E9I1 



\£l_ jQj - 1 
\U\ 2 ' \U\-V 



It has been shown by Proposition 9 in i29ll that 

if n < o-, then CG(n) < CG(cf). 

This property, together with the definition of CG, implies that CG is a partition measure on U. As CG(n) — 1, we 
obtain the following proposition. 

Proposition 13. Define 

f3cG(n,A)=f3 P (7T,A)-CG(n) 
for any n e Tl(U) and A C JJ. Then Bqc is a strong Pawlak roughness measure. 

Remark 2. Thanks to the axiomatic approach, Propositions fTTl fT2l andQ~3]show us that the roughness measures Be, 
Pe 1 , and Pec are of the properties stated in Corollary [3] and Propositions |5J|6] 13 and[8] 

Let us calculate the above five strong Pawlak roughness measures for the sets and partitions in Example Q] 

Example 4. Let us revisit Example\l\ where U — {01,02,03, a\, 0$}, n — {{a\}, {02, 03), {04, 05}), and cr — {{a\, 02, a^}, 
{«4, as}}. For A — {a\,a2, 03, 04}, we have already obtained that /3p(n,A) — /3p(p,A) — 0.4 in Example\l\ 
By a direct computation, we can readily get the following results: 



B x (n,A) = 


0.102 


< 


Mo-, A) = 


0.219; 


Bl(tt,A) = 


0.144 


< 


Mo-, A) = 


0.208; 


B E (n,A) = 


0.138 


< 


Mo; A) = 


0.233; 


B E -(n,A) = 


0.055 


< 


B E ,{o-,A) = 


= 0.126; 


Bcc(n,A) = 


0.032 


< 


Bcc(cr,A) 


= 0.088. 



All of these are consistent with the fact that n < cr. 



5. Conclusion 

In this paper, we have investigated roughness measure in an axiomatic way. The axiomatic definitions of roughness 
measure and partition measure have been provided. Based on this, we have given a unified construction of roughness 
measure, called strong Pawlak roughness measure, by combining partition measure into Pawlak's roughness measure. 
Some properties of the strong Pawlak roughness measure have been explored as well. In addition, we have shown that 
the existing roughness measures in B35I1 and B17I1 are two special instances of our strong Pawlak roughness measure, 
which supports our axiomatic definitions. The advantage of axiomatic approach is that it can bring together some 
seemingly different notions. As a result, we may study the properties of roughness measure in Definition [2] and weak 
roughness measure in Definition[3]instead of some specific measure. 

As mentioned earlier, our axiomatic definition of roughness measure is largely dependent on partition measure, 
while the latter is based upon the cardinalities of all equivalence classes in a partition. Because partition measures 
can be defined from different perspectives, more axiomatic definitions and properties of roughness measure remain 
to be investigated. In addition, the present work focuses on the classical rough sets based on partitions. It would be 
interesting to examine our axiomatic approach in the framework of covering rough sets [6, 28l l43ll or fuzzy rough sets 
{§1]. Finally, in the face of so many roughness measures, the criterion for choice remains yet to be addressed. 
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